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Many models of physics beyond the Standard Model yield exotic Higgs decays. Some of these, 
particularly those in which the Higgs decays to light quarks or gluons, can be very difficult to 
discover experimentally. Here we introduce a new set of jet substructure techniques designed to 
search for such a Higgs when its dominant decay is into gluons via light, uncolored resonances. We 
study this scenario in both V + h and tt + h production channels, and find both channels lead to 
discovery at the LHC with > 5a at C ~ 100 fb" 1 . 



I. INTRODUCTION 



Discovering the Higgs boson is one of the main physics 
goals of the LHC program. While collider search strate- 
gies have been well developed for the Standard Model 
(SM) Higgs, the presence of new light degrees of free- 
dom can dramatically alter Higgs phenomenology. For 
instance, in a class of models with an extended Higgs sec- 
tor, the Higgs can decay via the cascade h — > 2a — » 4A, 
where a is an on-shell pseudoscalar and A is a SM state to 
which the pseudoscalar decays. When this decay domi- 
nates, the branching fractions into the standard discovery 
channels such as h — ► 77, tt, bb are very suppressed, and 
new LHC strategies have to be developed to discover the 
Higgs. Previous studies have considered the pseudoscalar 
decaying into 2b, 2r, 2/i, and 2j pQ. A more challenging 
case is when the pseudoscalar decays into light hadronic 
final states, as is predicted in the "buried Higgs" model 
[2] where the dominant decay is h — > 2a — > 4 gluons. 
A further motivation to consider this particular decay is 
that it is less constrained by existing LEP analyses and 
may allow a the Higgs mass well below 115 GeV [3]. 

Here we will introduce powerful new jet substructure 
techniques which enable the LHC to discover a Higgs 
whose dominant decay is to QCD-like jets via a light un- 
colored resonance. Specifically, we will consider the decay 
h — > 2a — > 4g where m h ~ 80 GeV-120 GeV and a is a 
pseudoscalar with m a < 10 GeV. At first sight discov- 
ering this Higgs using its dominant decay mode seems 
hopeless because the dominant Higgs production chan- 
nels are swamped by overwhelming QCD backgrounds. 
To make progress, we follow the strategy pioneered by 
Ref. [J] and consider the Higgs in a boosted regime. By 
going to this extreme kinematical limit we are able to 
substantially reduce the background to our signal. How- 
ever, the backgrounds are still considerable, and whereas 



Ref. [3] made use of 6-tagging to push the boosted Higgs 
into the discovery region, here the situation is more chal- 
lenging. 

Fortunately, these exotic Higgs decays have three 
features which can distinguish them from QCD back- 
grounds: (1) m a furnishes an additional light scale, (2) 
the Higgs decay is symmetric, as both a's have equal 
mass, and (3) both the Higgs and the a's are uncolored. 
We will find that the key to success lies in employing jet 
substructure tools sensitive to these characteristics. We 
will use these tools to devise a set of cuts that allows us 
to obtain more than ~ 5er signal significance at the LHC 
with = 14 TeV and C ~ 100 fb -1 . 

This paper is organized as follows. Sec. [H] discusses 
a model illustrating the type of exotic Higgs decay we 
wish to investigate. In Sec. |III[ we will introduce jet 
substructure tools designed to find this Higgs, which we 
will then employ in Sec. 
V + h and tt 
our conclusions. 



TV] to study the Higgs in the 
h production channels. Sec. [V] contains 



II. ILLUSTRATIVE MODEL 



Higgs decays into light jets occur in well-motivated 
theoretical frameworks. In the presence of a light pseu- 
doscalar particle a with cubic couplings to the Higgs, the 
Higgs can undergo the cascade decay h — > 2a — > 4 par- 
tons. This cascade was shown to be the generic decay 
mode in a class of models where the Higgs is a super- 
symmetric Goldstone boson [2 , and is also possible in 
extensions of the MSSM with an additional singlet su- 
perfield [5]. 

In the model of [5] the Higgs boson h has an effective 



2 



derivative interaction with the pseudoscalar a, 



£-ha 2 



(1) 



where v is the clcctrowcak scale, / is the global symmetry 
breaking scale, and c is a coefficient of order unity. As 
long as / is not much larger than the electroweak scale 
the decay h —¥ 2a dominates over the standard h — > bb 
mode. The pseudoscalar is not stable because it has 
Yukawa couplings to the SM fermions, iy^aip^^j. The 
largest Yukawa coupling is to the 3rd generation quarks, 
while it is suppressed for leptons and lighter quark gener- 
ations. Thus, for m a > 2mi, ~ 10 GeV the pseudoscalar 
decays almost exclusively into bottom quarks, resulting 
in the h — > 46 cascade. For m a < 2mt, the structure of 
the pseudoscalar Yukawa couplings means that the de- 
cay into two gluons via a loop of 3rd generation quarks 
dominates over tree level decays to (e.g.) 2r or 2c. The 
net result is a h — > 4g cascade decay occurring with a 
0.8 ~ 0.9 branching fraction. For this decay mode, the 
current limit on mn is only 86 GeV assuming the Higgs 
is produced with the SM cross section [3] For simplicity 
and clarity of presentation, in this paper we assume a 
100% Higgs branching fraction into four gluons. 

The production of buried Higgses at the LHC proceeds 
through similar vertices as in the SM. We shall assume 
here that the Higgs couples to the electroweak bosons 
and the top quarks with the same strength as in the SM, 
although in some models realizing the buried Higgs sce- 
nario these couplings may again be slightly modified. 



III. JET SUBSTRUCTURE TOOLS 

A buried Higgs is difficult to discover because its decay 
products are difficult to distinguish from ordinary QCD 
radiation. In the case at hand, because m a <C m^, the 
gluons from each a are very collimated, and so an un- 
boosted buried Higgs will be resolved as two jets. This 
will be very difficult to distinguish from the enormous 
backgrounds from QCD radiation. The extreme kine- 
matic configuration where the Higgs has a large pt, and 
is thus resolved entirely in one jet, is far more difficult for 
background processes to mimic. In this regime, the two 
jets from Higgs decay are themselves collimated into a 
single fat jet with a characteristic substructure. We will 
consider two such boosted scenarios, pp — > hW (adopting 
the basic kinematic cuts of Ref. [3]) and pp —> hti with a 
mildly boosted Higgs. 

The first step of our analyses is to cluster our events 
into relatively large jets and identify a candidate boosted 
Higgs jet. We then, along the lines of Ref. [H |6], use a 
cleaning procedure to remove contamination from pileup 
and underlying event from the jet and place a cut on 
its mass. To make further progress we must look to the 
distinguishing features of the exotic decays. 

One characteristic feature of the signal is that the jets 
from decays of light pseudoscalars a have small invariant 



masses, of order m a ;$ 10 GeV. This is clearly indepen- 
dent of the a's pr, while the invariant mass of a QCD jet 
grows with pr'. \J (m 2 j) ~ ^^ptR, where C = 3(4/3) 
for gluon (quark) initiated jets [7]. Because we work in 
the boosted regime where the a's have a large px, we 
expect the bulk of the QCD background subjets to have 
masses above 10 GeV. Thus, requiring that the average 
mass of the two hardest subjets be small (throughout we 
will denote the ith hardest subjet as jj), 



m{ji) + m(j 2 ) 



< 10 GeV, 



is an efficient way to separate signal from background. 

The signal events are also distinguished by the sym- 
metry of their decay products: both subjets arise from 
particles of equal mass. This can be distinguished by a 
cut on mass democracy: 



m(Ji) m(j 2 ) 
m(j 2 ) ' m{ji) 



(2) 



At the parton level a = 1 at leading order, while for the 
background there is no reason for the QCD radiation to 
produce democratic jets. 

Finally, signal and background events differ by their 
color structure [S]. For signal events color is only seen 
very late in the Higgs decay process: neither the Higgs 
nor the a's carry color charge QCD processes therefore 
only becomes operative only at the scale ~ 10 GeV af- 
ter the pseudoscalars decay into gluons. By contrast, 
the background jets are initiated by hard colored parti- 
cles, which are color-connected to the rest of the event; 
moreover, there is more phase space for QCD radiation. 
Therefore, we expect that the background has more ra- 
diation inside the fat jet cone than the signal does. We 
can quantify this intuition using the flow variable 



PT(j3) 



Pr{h) +Pt{]2 



(3) 



which is motivated by the fact that the signal is unlikely 
to yield radiation aside from that constituting the two 
collimated a's. We therefore expect the typical value of 
/3 for background processes to be much larger than for the 
signal. Before proceeding, we note that /3 can be sensi- 
tive to very soft radiation, depending on the cut one uses. 
Therefore, we employ j3 with a threshold: p™ ln and set 
j3 = for pxiiz) < p™ ln - Other flow variables could be 
defined to further boost discovery of the buried Higgs. In 
particular, since signal radiates less, a simple cut on the 
number of subjets above p™ ln ~ 1 GeV falling inside the 
fat jet cone adds more discriminating power. However 
we have not included this cut here because QCD predic- 
tions for the number of soft jets are not entirely reliable 
at the present stage. Measuring the number of tracks 
emanating from the leading subjets could also efficiently 
separate signal from background [SJ. 
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FIG. 1: Reconstructed mn = 100 GeV Higgs mass (left) in 
the V + h channel, after the cuts of Table [I] (excluding the cut 
on mii)', (right) in the ti+h channel, after the cuts of Table |H| 
(excluding the cut on tuh)- Error bars show statistical errors. 



IV. ANALYSIS 

Here we apply the substructure tools developed above 
to two processes yielding a boosted Higgs: pp —> hW 
and pp —¥ htt. Before proceeding with the analysis we 
describe our Monte Carlo tools and assumptions. 

We generate all signal and background events for htt 
at tree level using MadGraph v4 [TO] and shower them 
using Pythia 6.4.21 [TT]. We incorporate underlying 
event and pile-up using Pythia's "DW" tune and assum- 
ing a luminosity per bunch crossing of 0.05 mb . We 
generated signal samples for = 80, 100, 120 GeV and 
rn a = 8 GeV. Our tt+ jets sample is matched out to 
two jets using the fc^-MLM matching procedure [12] (our 
V+ jets sample requires no matching as it is dominated 
by 2 — > 2 processes). Jet clustering is performed using 
the anti-fcr algorithm [13] as implemented in Fast jet 
2.3 [Hj. When constructing subjets our procedure is to 
re-cluster the constituents of a jet using anti-fc-r with a 
smaller radius, denoted i? S ub- 



A. Discovering a buried Higgs in the V + h channel 

Here we consider a boosted Higgs recoiling against a 
vector boson as in Ref. 4 . As the production rate for 
pp — > hW is larger than pp — > hZ, and the branching 
ratio of W into leptons is much larger than that of Z into 
leptons, we will restrict ourselves to the process pp — > hW 
where W — > Iv for I = e,fi. 

Our events are clustered using jet radii R of 0.8, 1.0, 
and 1.2 for of 80, 100, and 120 GeV, respectively. 
To force ourselves into the boosted region we will con- 
sider events with a jet of pr > 200 GeV. The domi- 
nant background then is pp W + j. As one can see 
in Table [TJ the initial backgrounds are horrendous. De- 
manding that the average mass of the hardest two subjets 
(using i? su b = 0.3) lie below 10 GeV and requiring the 
trimmed |15j mass of the jet (using the trimming param- 
eter / cut = 0.03) lie within m-h ± 10 GeV helps, but it is 
not sufficient for a Higgs discovery. 



TABLE I: Cut efficiencies for a m h = 100 GeV Higgs in the 
pp — > hW channel using the procedure outlined in Sec. |IV A| 
At the end of the table we include results obtained using two 
different values of p™ m for j3. 





<?s lg (fb) 


o"6 9 (fb) 


S/B 


s/Vb 


p T (j) > 200 GeV 


16 


30000 


0.00052 


0.9 


subjet mass 


12 


19000 


0.00062 


0.9 


Hi£^s window 


7.1 


400 


0.018 


3.6 


a > 0.7 


4.1 


140 


0.030 


3.5 


p < 0.005, p™ in = 1 GeV 


0.67 


0.74 


0.90 


7.8 


< 0.005, p™ in = 5 GeV 


2.9 


2.6 


0.11 


5.7 



However, after cutting on the jet substructure variables 
a > 0.7 and (3 < 0.005, 0.005, and 0.007 for m h of 80, 
100, and 120 GeV, respectively, one finds a prominent 
signal, discoverable regardless of whether one uses p™ m = 
1 GeV or a more conservative 5 GeV. The Higgs mass 
distribution after these cuts is shown in Fig. [T] The final 
signal significances for the three Higgs masses we consider 
are shown in Table ITTT1 



B. Discovering a buried Higgs in the tt + h channel 

Here the signal process of interest is the associated 
production of a Higgs with a tt pair, followed by lep- 
tonic decays of both top quarks and Higgs decaying as 
h — > aa — > 4g. The final state consists of 2 6-tagged 
jets, 2 opposite-sign leptons, and (at least) 2 hard jets. 
The main background is ti+ jets, with secondary con- 
tributions from Z + bb and ttZ . Background processes 
with jets faking a lepton or a fo-jet are subleading. For 
the signal we use the SM NLO tiH cross-section [TB]; in 
particular <j t th ~ 1 pb for = 100 GeV. We use the 
NLO + NLL calculation of the inclusive tt+ jets cross- 
section to normalize the tt+ jets background [T71 [TB] . 
attj — 908 pb. The NLO cross-section for ttZ is much 
smaller, <j t tz — 1.1 pb |19] . 

Since the buried Higgs does not produce &-quarks in its 
decay, the combinatoric problems that contribute to the 
difficulty of using the tih channel in the SM are signifi- 
cantly ameliorated. In the dileptonic channel, there is in 
principle no combinatoric background: the decay prod- 
ucts of the top quarks can be cleanly separated from the 
decay products of the Higgs, much as in the W + h chan- 
nel. We first cluster particles using the anti-fcr algorithm 
with R su b = 0.4. To select for events containing 2 top 
quarks decaying leptonically we require two opposite-sign 
isolated leptons and two &-jets satisfying px, e > 15 GeV, 
p T ^ > 10 GeV, p T ,b > 20 GeV, \rfi,b\ < 2.5. We assume 
a flat b-tagging efficiency of 0.6. To control the Z + bb 
background we require that same-flavor leptons do not 
reconstruct a Z, \mu — m z\ > 10 GeV. After these cuts 
the cross-section for Z + bb is approximately 10% of the 
cross-section for dileptonic ti+ jets. The importance of 
Z + bb drops further relative to tt+ jets when kinematic 
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cr„ia (fb) 


(Tbg (fb) 


S/B 




preselection 


8.1 


6700 


0.001 


1.0 


p T (j) > 125 GeV 


3.1 


750 


0.004 


1.1 


Pt O2) > 40 GcV,m < 10 GeV 


0.58 


22 


0.03 


1.2 


m(j) = m h ± 10 GeV 


0.45 


3.9 


0.1 


2.3 


a > 0.7 


0.40 


2.0 


0.2 


2.9 


13 < 0.03, p™ in = 1 GeV 


0.28 


0.21 


1.3 


6.1 


13 < 0.03, p™ in = 5 GeV 


0.29 


0.25 


1.1 


5.7 



TABLE II: Cut efficiencies for a m h = 100 GeV Higgs in the 
tih channel using the procedure outlined in Sec. IV B. 



TABLE III: Final signal significance (S/\fB) and signal-to- 
background at C = 100 fb -1 for three different Higgs masses 
in the pp — > hW and pp — > hti channels. The numbers in 
parenthesis are the significance using m = 5 GeV for the ft 
cut, while those outside the parenthesis are for p™ m = 1 GeV. 





m h = 80 GeV 


m h = 100 GeV 


m h = 120 GeV 


pp -> hW S/V~B 


6.6 (4.8) 


7.8 (5.7) 


7.0 (6.9) 


S/B 


0.34 (0.067) 


0.90 (0.11) 


0.80 (0.24) 


pp -> hti S/VB 


6.1 (5.9) 


6.1 (5.7) 


7.1 (7.1) 


S/B 


1.1 (0.97) 


1.3 (1.1) 


2.5 (2.5) 



cuts are applied, and subsequently we neglect this con- 
tribution to the background. 

Next, we impose further selection criteria on the re- 
maining untagged jets. We take jets with pr > 10 GeV 
and further cluster them using the anti-fey algorithm into 
fat jets with R — 1.5. We then trim the fat jets by 
removing the contribution of R su b — 0.4 subjets with 
Pt < 0.15pTjat from the fat jets. We select events con- 
taining at least one fat jet with px > 125 GeV. 

The hardest fat jet is our Higgs candidate, and we 
apply to it similar kinematic and substructure cuts as in 
the W + h channel. We demand that the candidate jet 
contains at least 2 R su b = 0.4 subjets with pt > 40 GeV 
with the average mass of the hardest two subjets below 
10 GeV. Once again, at this stage bump-hunting for a fat 
jet in the to/j ± 10 GeV mass window is not enough for 
a discovery, and we need to cut on the jet substructure. 
Requiring a > 0.7 and (3 < 0.03 for PT,min = 1 GeV 
brings us well above the discovery level for mh ~ 100. 
The cut flow for rrih = 100 GeV is shown in table [TTJ 
and the invariant mass distribution of the fat jet mass 
after all cuts is shown in figjl] For = 120 GeV we 
need slightly harder kinematic cuts, pt{j) > 155 GeV, 
Pt{32) > 50 GeV, P < 0.06 to lift the significance above 
the discovery level. The final significance for all Higgs 
masses is given in table |III| 



V. CONCLUSIONS 

Here we have introduced a set of jet substructure tech- 
niques designed to discover a Higgs undergoing challeng- 
ing exotic decays. Remarkably, we found that these tools 
are sufficient to discover a Higgs whose dominant decay 
is to four gluons in both W + h and ti+ h channels after 
C ~ 100 fb - . While the systematic errors in both the 
background cross sections and the color flow cuts will 
need to be carefully studied, the comfortable values of 
S/B which we are able to obtain should ensure that dis- 
covery is possible. One further lesson is that the tt + h 
channel can be relatively more useful for a non-standard 
Higgs than it is in the SM. We believe that similar tech- 
niques can be applied to boost the LHC discovery poten- 
tial for a wider class of models where a light Higgs boson 
undergoes complex decays, e.g. h — > 4b or h — > 4r. 

These techniques demonstrate the potential for the 
LHC to probe qualitatively new scenarios of physics be- 
yond the SM as new jet substructure tools are devel- 
oped. One important point of our analysis is that a lot 
of discriminating power is contained in soft (a few GeV) 
QCD radiation. Further progress in detector sensitivity 
to soft radiation, as well as a better theoretical control 
over QCD predictions at the low invariant mass region 
of the spectrum could lead to further improvement in 
the discovery potential of non-standard Higgs bosons, or 
indeed to other non-standard new physics. 

Note added: When this work was finished Ref. 
appeared in which the same Higgs decay is studied with 
similar conclusions for the LHC discovery potential. 
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